Mining Dense Structures by Enumerating Weighted and Multi-level Pseudo-Bicliques

نویسندگان

  • Zareen Alamgir
  • Saira Karim
  • Syed Husnine
چکیده

Pseudo-bicliques model various problems encountered in bio-informatics, data mining and networks. They relax the rigid connectivity requirement of bicliques to cater missing and noisy data. In this paper, we consider the weighted density based model of pseudo-biclique. This model defines pseudo-biclique as a bipartite subgraph such that the ratio of the number of its edges to the number of edges in biclique of same size is no less than a given threshold value. The weighted model of pseudo-bicliques better fits the real-world situations and give much more flexibility to researchers. We propose an algorithm based on reverse search to generate all weighted pseudo-bicliques in a given graph. This is computationally non-trivial task as simple straightforward branch-and-bound and back-tracking schemes involve an NP complete problem. Furthermore, we proposed the use of multi-level pseudo-bicliques to discover knowledge at multiple levels and extend our algorithm to enumerate all multi-level pseudo-bicliques. We introduce various enhancements to our algorithm based on the structure of pseudo-bicliques and underlying bipartite graph. We evaluated the performance of our algorithms on random graphs and real-world problems. The results are quite promising and show that average linear time is incurred to generate each pseudo-biclique.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Algorithm for Enumerating Pseudo Cliques

The problem of finding dense structures in a given graph is quite basic in informatics including data mining and data engineering. Clique is a popular model to represent dense structures, and widely used because of its simplicity and ease in handling. Pseudo cliques are natural extension of cliques which are subgraphs obtained by removing small number of edges from cliques. We here define a pse...

متن کامل

Efficient Mining of Large Maximal Bicliques

Many real world applications rely on the discovery of maximal biclique subgraphs (complete bipartite subgraphs). However, existing algorithms for enumerating maximal bicliques are not very efficient in practice. In this paper, we propose an efficient algorithm to mine large maximal biclique subgraphs from undirected graphs. Our algorithm uses a divide-and-conquer approach. It effectively uses t...

متن کامل

Incremental Maintenance of Maximal Bicliques in a Dynamic Bipartite Graph

We consider incremental maintenance of maximal bicliques from a dynamic bipartite graph that changes over time due to the addition of edges. When new edges are added to the graph, we seek to enumerate the change in the set of maximal bicliques, without enumerating the set of maximal bicliques that remain unaffected. The challenge in an efficient algorithm is to enumerate the change without expl...

متن کامل

A Change-Sensitive Algorithm for Maintaining Maximal Bicliques in a Dynamic Bipartite Graph

We consider the maintenance of maximal bicliques from a dynamic bipartite graph that changes over time due to the addition or deletion of edges. When the set of edges in a graph changes, we are interested in knowing the change in the set of maximal bicliques (the “change”), rather than in knowing the set of maximal bicliques that remain unaffected. The challenge in an efficient algorithm is to ...

متن کامل

Mining maximal quasi-bicliques: Novel algorithm and applications in the stock market and protein networks

Several real world applications require mining of bicliques, as they represent correlated pairs of data clusters. However, the mining quality is adversely affected by missing and noisy data. Moreover, some applications only require strong interactions between data members of the pairs, but bicliques are pairs that display complete interactions. We address these two limitations by proposing maxi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012